Intersession Compensation and Scoring Methods in the i-vectors Space for Speaker Recognition
نویسندگان
چکیده
The total variability factor space in speaker verification system architecture based on Factor Analysis (FA) has greatly improved speaker recognition performances. Carrying out channel compensation in a low dimensional total factor space, rather than in the GMM supervector space, allows for the application of new techniques. We propose here new intersession compensation and scoring methods. Furthermore, this new approach contributes to a better understanding of the session variability characteristics in the total factor space.
منابع مشابه
Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification
This paper presents a new speaker verification system architecture based on Joint Factor Analysis (JFA) as feature extractor. In this modeling, the JFA is used to define a new low-dimensional space named the total variability factor space, instead of both channel and speaker variability spaces for the classical JFA. The main contribution in this approach, is the use of the cosine kernel in the ...
متن کاملExploring Hilbert envelope based acoustic features in i-vector speaker verification using HT-PLDA
In this study we evaluate the effectiveness of our recently introduced Mean Hilbert Envelope Coefficients (MHEC) in i-vector speaker verification using heavy-tailed probabilistic linear discriminant analysis (HT-PLDA) as the compensation/backend framework. The i-vectors are estimated for MHECs, and also the conventional and widely used MFCCs for comparison. The linear discriminant analysis (LDA...
متن کاملI-vector based speaker recognition using advanced channel compensation techniques
This paper investigates advanced channel compensation techniques for the purpose of improving i-vector speaker verification performance in the presence of high intersession variability using the NIST 2008 and 2010 SRE corpora. The performance of four channel compensation techniques: (a) weighted maximum margin criterion (WMMC), (b) source-normalized WMMC (SN-WMMC), (c) weighted linear discrimin...
متن کاملSpeaker Comparison with Inner Product Discriminant Functions
Speaker comparison, the process of finding the speaker similarity between two speech signals, occupies a central role in a variety of applications—speaker verification, clustering, and identification. Speaker comparison can be placed in a geometric framework by casting the problem as a model comparison process. For a given speech signal, feature vectors are produced and used to adapt a Gaussian...
متن کاملProsodic speaker verification using subspace multinomial models with intersession compensation
We propose a novel approach to modeling prosodic features. Inspired by Joint Factor Analysis model (JFA), our model is based on the same idea of introducing subspace of model parameters. However, the underlying Gaussian Mixture distribution of JFA is replaced by multinomial distribution to model sequences of discrete units rather than continuous features. In this work, we use the subspace model...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011